The interactive systems labs view4you video indexing system

نویسندگان

  • Thomas Kemp
  • Petra Geutner
  • Michael Schmidt
  • Borislav Tomaz
  • Manfred Weber
  • Martin Westphal
  • Alexander H. Waibel
چکیده

The recognition of broadcast news is a challenging problem in speech recognition. To achieve the long-term goal of robust, real-time news transcription, several problems have to be overcome, e.g. the variety of acoustic conditions and the unlimited vocabulary. Recently, a number of sites have been working on content-addressable multi-media information sources. In the presented paper, we focus on extending this work towards a multi-lingual environment, where queries and multimedia documents may appear in multiple languages. In cooperation with the Informedia project at CMU [4], we attempt to provide cross-lingual access to German and Serbo-Croatian newscasts. 1. THE VIEW4YOU SYSTEM In the View4You system, German and Serbocroatian public newscasts are recorded daily using standard consumer electronics equipment. The newscasts are automatically segmented and an index is created for each of the segments by means of automatic speech recognition. The user can query the system in natural language. The system returns a list of segments which is sorted by relevance with respect to the user query. By selecting a segment, the user can watch the corresponding part of the news show on his or her computer screen. In this work, we give an overview over the three main parts of the View4You system, namely the segmenter, the speech recognizer, and the information retrieval engine.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reducing the OOV rate in broadcast news speech recognition

Thomas Kemp Alex Waibel Interactive Systems Laboratories, ILKD University of Karlsruhe 76128 Karlsruhe, Germany ABSTRACT The recognition of broadcast news is a challenging problem in speech recognition. To achieve the long-term goal of robust, real-time news transcription, several problems have to be overcome, e.g. the variety of acoustic conditions and the unlimited vocabulary. In this paper w...

متن کامل

Evaluating different information retrieval algorithms on real-world data

More and more data is produced in the form of videos, which are opaque to textual queries. To allow searching in video data collections, two problems have to be solved: The automatic generation of a searchable index, and the effective search in the automatically produced and therefore imperfect index. The ISL View4You system is a prototype of a video indexing and retrieval system which both gen...

متن کامل

Components and systems for interactive video indexing

The process of video indexing determines the quality of video retrieval. We present a modularization of indexing systems in which dependencies of components are made explicit. We stress the impact of human interaction in the architectural scheme, as the semantic gap between automatic abstractions and semantic indices requires human intervention. We discuss the components for efficient indexing,...

متن کامل

Believable Visual Feedback in Motor Learning Using Occlusion-based Clipping in Video Mapping

Gait rehabilitation systems provide patients with guidance and feedback that assist them to better perform the rehabilitation tasks. Real-time feedback can guide users to correct their movements. Research has shown that the quality of feedback is crucial to enhance motor learning in physical rehabilitation. Common feedback systems based on virtual reality present interactive feedback in a monit...

متن کامل

The France Telecom Orange Labs (Beijing) Video Semantic Indexing Systems - TRECVID 2010 Notebook Paper

In this paper, we described the latest video semantic indexing systems developed at France Telecom Orange Labs (Beijing). In our previous systems for TRECVID 2009, the features of color, edge, texture and SIFT were used. This year, some new features based on local descriptors were added for performance improvement. Three Full runs (130 concepts) based on later fusion and one Light run (10 conce...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998